Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 10353 |
| Missing cells | 16072 |
| Missing cells (%) | 8.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.4 MiB |
| Average record size in memory | 542.2 B |
Variable types
| Categorical | 7 |
|---|---|
| Numeric | 11 |
Loan ID has a high cardinality: 10000 distinct values | High cardinality |
Customer ID has a high cardinality: 10000 distinct values | High cardinality |
Annual Income is highly correlated with Monthly Debt | High correlation |
Monthly Debt is highly correlated with Annual Income and 1 other fields | High correlation |
Number of Open Accounts is highly correlated with Maximum Open Credit | High correlation |
Number of Credit Problems is highly correlated with Bankruptcies | High correlation |
Current Credit Balance is highly correlated with Monthly Debt and 1 other fields | High correlation |
Maximum Open Credit is highly correlated with Number of Open Accounts and 1 other fields | High correlation |
Bankruptcies is highly correlated with Number of Credit Problems | High correlation |
Annual Income is highly correlated with Monthly Debt | High correlation |
Monthly Debt is highly correlated with Annual Income | High correlation |
Number of Credit Problems is highly correlated with Bankruptcies and 1 other fields | High correlation |
Bankruptcies is highly correlated with Number of Credit Problems | High correlation |
Tax Liens is highly correlated with Number of Credit Problems | High correlation |
Number of Credit Problems is highly correlated with Bankruptcies | High correlation |
Current Credit Balance is highly correlated with Maximum Open Credit | High correlation |
Maximum Open Credit is highly correlated with Current Credit Balance | High correlation |
Bankruptcies is highly correlated with Number of Credit Problems | High correlation |
Annual Income is highly correlated with Monthly Debt | High correlation |
Home Ownership is highly correlated with Purpose | High correlation |
Purpose is highly correlated with Home Ownership | High correlation |
Monthly Debt is highly correlated with Annual Income | High correlation |
Number of Credit Problems is highly correlated with Bankruptcies and 1 other fields | High correlation |
Current Credit Balance is highly correlated with Maximum Open Credit | High correlation |
Maximum Open Credit is highly correlated with Current Credit Balance | High correlation |
Bankruptcies is highly correlated with Number of Credit Problems | High correlation |
Tax Liens is highly correlated with Number of Credit Problems | High correlation |
Loan ID has 353 (3.4%) missing values | Missing |
Customer ID has 353 (3.4%) missing values | Missing |
Current Loan Amount has 353 (3.4%) missing values | Missing |
Term has 353 (3.4%) missing values | Missing |
Credit Score has 2334 (22.5%) missing values | Missing |
Annual Income has 2334 (22.5%) missing values | Missing |
Years in current job has 780 (7.5%) missing values | Missing |
Home Ownership has 353 (3.4%) missing values | Missing |
Purpose has 353 (3.4%) missing values | Missing |
Monthly Debt has 353 (3.4%) missing values | Missing |
Years of Credit History has 353 (3.4%) missing values | Missing |
Months since last delinquent has 5659 (54.7%) missing values | Missing |
Number of Open Accounts has 353 (3.4%) missing values | Missing |
Number of Credit Problems has 353 (3.4%) missing values | Missing |
Current Credit Balance has 353 (3.4%) missing values | Missing |
Maximum Open Credit has 353 (3.4%) missing values | Missing |
Bankruptcies has 375 (3.6%) missing values | Missing |
Tax Liens has 354 (3.4%) missing values | Missing |
Maximum Open Credit is highly skewed (γ1 = 51.4146651) | Skewed |
Loan ID is uniformly distributed | Uniform |
Customer ID is uniformly distributed | Uniform |
Number of Credit Problems has 8653 (83.6%) zeros | Zeros |
Tax Liens has 9810 (94.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-10 13:05:53.860926 |
|---|---|
| Analysis finished | 2023-12-10 13:06:23.550914 |
| Duration | 29.69 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Memory size | 919.4 KiB |
| ed055a45-8724-44e3-9941-f571015e2c4a | 1 |
|---|---|
| b052ec5f-b849-44e0-8218-57c9bcebbe88 | 1 |
| ff3136a6-7227-44f0-81df-299b3ea41c33 | 1 |
| 48b259bd-a453-4c8e-8426-2a2a9a7a7775 | 1 |
| 3940bd5e-35ed-43e0-9af7-5db097149931 | 1 |
| Other values (9995) |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 10000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | f738779f-c726-40dc-92cf-689d73af533d |
|---|---|
| 2nd row | 6dcc0947-164d-476c-a1de-3ae7283dde0a |
| 3rd row | f7744d01-894b-49c3-8777-fc6431a2cff1 |
| 4th row | 83721ffb-b99a-4a0f-aea5-ef472a138b41 |
| 5th row | 08f3789f-5714-4b10-929d-e1527ab5e5a3 |
Common Values
| Value | Count | Frequency (%) |
| ed055a45-8724-44e3-9941-f571015e2c4a | 1 | < 0.1% |
| b052ec5f-b849-44e0-8218-57c9bcebbe88 | 1 | < 0.1% |
| ff3136a6-7227-44f0-81df-299b3ea41c33 | 1 | < 0.1% |
| 48b259bd-a453-4c8e-8426-2a2a9a7a7775 | 1 | < 0.1% |
| 3940bd5e-35ed-43e0-9af7-5db097149931 | 1 | < 0.1% |
| 2d399fa6-58f0-4888-99eb-5038d5bdc5c5 | 1 | < 0.1% |
| 98463866-41a0-4fd5-bbad-b1721be9c8f1 | 1 | < 0.1% |
| 71f9269f-adf4-4da8-af89-772e3041ee8d | 1 | < 0.1% |
| bc55e336-7fec-48c8-8c25-88c26ffddfe8 | 1 | < 0.1% |
| 1c7fcfe6-077d-43e6-807a-e169409430a7 | 1 | < 0.1% |
| Other values (9990) | 9990 | |
| (Missing) | 353 | 3.4% |
Length
| Value | Count | Frequency (%) |
| ed055a45-8724-44e3-9941-f571015e2c4a | 1 | < 0.1% |
| 0b2f1b66-741e-4e37-a929-99926cdc9e9a | 1 | < 0.1% |
| add946a5-20a5-4211-bf22-408525123b1d | 1 | < 0.1% |
| f7744d01-894b-49c3-8777-fc6431a2cff1 | 1 | < 0.1% |
| 83721ffb-b99a-4a0f-aea5-ef472a138b41 | 1 | < 0.1% |
| 08f3789f-5714-4b10-929d-e1527ab5e5a3 | 1 | < 0.1% |
| a4957169-d809-44cc-847b-975400bc8d11 | 1 | < 0.1% |
| 43467302-94fe-494b-b52f-3fd891fea71c | 1 | < 0.1% |
| 930c7cb3-6086-434a-9547-3ed41c181552 | 1 | < 0.1% |
| d08f3a5e-93df-40e7-bdd8-cba59180bddf | 1 | < 0.1% |
| Other values (9990) | 9990 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Memory size | 919.4 KiB |
| 19af16fc-b2b2-44dd-8ce3-c5556b419d65 | 1 |
|---|---|
| 3bc77c02-e39e-4341-941b-9d4dc17b660e | 1 |
| fa6a4c11-7748-48cc-914c-2ea2ec84773e | 1 |
| 9328e4e2-8488-41aa-8ef5-869b74fed3fb | 1 |
| b4a93297-456f-4ba5-987a-a491c783e86b | 1 |
| Other values (9995) |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 10000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | ded0b3c3-6bf4-4091-8726-47039f2c1b90 |
|---|---|
| 2nd row | 1630e6e3-34e3-461a-8fda-09297d3140c8 |
| 3rd row | 2c60938b-ad2b-4702-804d-eeca43949c52 |
| 4th row | 12116614-2f3c-4d16-ad34-d92883718806 |
| 5th row | 39888105-fd5f-4023-860a-30a3e6f5ccb7 |
Common Values
| Value | Count | Frequency (%) |
| 19af16fc-b2b2-44dd-8ce3-c5556b419d65 | 1 | < 0.1% |
| 3bc77c02-e39e-4341-941b-9d4dc17b660e | 1 | < 0.1% |
| fa6a4c11-7748-48cc-914c-2ea2ec84773e | 1 | < 0.1% |
| 9328e4e2-8488-41aa-8ef5-869b74fed3fb | 1 | < 0.1% |
| b4a93297-456f-4ba5-987a-a491c783e86b | 1 | < 0.1% |
| 8f65b9c7-bfb4-46ed-b2cd-ab8f5bc32328 | 1 | < 0.1% |
| 3e407115-5bd7-43a2-a524-570beebdfd9e | 1 | < 0.1% |
| 95afe69c-2f56-4493-a6cc-a937078c1631 | 1 | < 0.1% |
| 6c5d2350-56e0-4101-a185-58cede981065 | 1 | < 0.1% |
| ebd746ba-91b4-4985-99f7-47e4999c5618 | 1 | < 0.1% |
| Other values (9990) | 9990 | |
| (Missing) | 353 | 3.4% |
Length
| Value | Count | Frequency (%) |
| 19af16fc-b2b2-44dd-8ce3-c5556b419d65 | 1 | < 0.1% |
| 6a1adeda-079b-49e5-ac7c-91828f2806a0 | 1 | < 0.1% |
| 163b8125-8f24-4b8f-ba59-23ea017f5b48 | 1 | < 0.1% |
| 2c60938b-ad2b-4702-804d-eeca43949c52 | 1 | < 0.1% |
| 12116614-2f3c-4d16-ad34-d92883718806 | 1 | < 0.1% |
| 39888105-fd5f-4023-860a-30a3e6f5ccb7 | 1 | < 0.1% |
| 6878d414-6a22-4712-ae43-9b3f798e463a | 1 | < 0.1% |
| 48113a98-a4a0-4956-b57d-f0ce344826fb | 1 | < 0.1% |
| 19941661-98e2-4800-93c9-a0e92057c813 | 1 | < 0.1% |
| 4080a828-a61a-4f04-a627-397f4319500c | 1 | < 0.1% |
| Other values (9990) | 9990 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6786 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11603801.21 |
| Minimum | 19470 |
|---|---|
| Maximum | 99999999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 19470 |
|---|---|
| 5-th percentile | 73036.7 |
| Q1 | 178948 |
| median | 309276 |
| Q3 | 515707.5 |
| 95-th percentile | 99999999 |
| Maximum | 99999999 |
| Range | 99980529 |
| Interquartile range (IQR) | 336759.5 |
Descriptive statistics
| Standard deviation | 31600097.14 |
|---|---|
| Coefficient of variation (CV) | 2.72325392 |
| Kurtosis | 3.956085669 |
| Mean | 11603801.21 |
| Median Absolute Deviation (MAD) | 144331 |
| Skewness | 2.440286167 |
| Sum | 1.160380121 × 1011 |
| Variance | 9.985661393 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99999999 | 1133 | 10.9% |
| 172436 | 7 | 0.1% |
| 154704 | 6 | 0.1% |
| 442596 | 6 | 0.1% |
| 221892 | 6 | 0.1% |
| 257554 | 5 | < 0.1% |
| 109472 | 5 | < 0.1% |
| 223036 | 5 | < 0.1% |
| 218834 | 5 | < 0.1% |
| 265188 | 5 | < 0.1% |
| Other values (6776) | 8817 | |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 19470 | 1 | < 0.1% |
| 21472 | 1 | < 0.1% |
| 21516 | 1 | < 0.1% |
| 21560 | 2 | |
| 21604 | 3 | |
| 21626 | 1 | < 0.1% |
| 21670 | 1 | < 0.1% |
| 21692 | 1 | < 0.1% |
| 21890 | 2 | |
| 21956 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999999 | 1133 | |
| 789096 | 1 | < 0.1% |
| 789030 | 3 | < 0.1% |
| 788942 | 2 | < 0.1% |
| 788480 | 4 | < 0.1% |
| 788414 | 2 | < 0.1% |
| 788326 | 1 | < 0.1% |
| 788260 | 1 | < 0.1% |
| 788172 | 1 | < 0.1% |
| 788018 | 2 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Memory size | 662.8 KiB |
| Short Term | |
|---|---|
| Long Term |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.7295 |
| Min length | 9 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Short Term |
|---|---|
| 2nd row | Short Term |
| 3rd row | Short Term |
| 4th row | Short Term |
| 5th row | Short Term |
Common Values
| Value | Count | Frequency (%) |
| Short Term | 7295 | |
| Long Term | 2705 | 26.1% |
| (Missing) | 353 | 3.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| term | 10000 | |
| short | 7295 | |
| long | 2705 | 13.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 272 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 2334 |
| Missing (%) | 22.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1077.99152 |
| Minimum | 585 |
|---|---|
| Maximum | 7510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 585 |
|---|---|
| 5-th percentile | 662 |
| Q1 | 706 |
| median | 725 |
| Q3 | 741 |
| 95-th percentile | 6710 |
| Maximum | 7510 |
| Range | 6925 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 1477.467761 |
|---|---|
| Coefficient of variation (CV) | 1.370574567 |
| Kurtosis | 12.9134383 |
| Mean | 1077.99152 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 3.855418036 |
| Sum | 8644414 |
| Variance | 2182910.984 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 747 | 197 | 1.9% |
| 740 | 195 | 1.9% |
| 746 | 177 | 1.7% |
| 738 | 176 | 1.7% |
| 742 | 176 | 1.7% |
| 741 | 175 | 1.7% |
| 739 | 164 | 1.6% |
| 745 | 157 | 1.5% |
| 748 | 157 | 1.5% |
| 722 | 156 | 1.5% |
| Other values (262) | 6289 | |
| (Missing) | 2334 | 22.5% |
| Value | Count | Frequency (%) |
| 585 | 1 | < 0.1% |
| 586 | 1 | < 0.1% |
| 587 | 1 | < 0.1% |
| 588 | 1 | < 0.1% |
| 594 | 1 | < 0.1% |
| 595 | 3 | |
| 596 | 2 | |
| 597 | 1 | < 0.1% |
| 598 | 2 | |
| 599 | 2 |
| Value | Count | Frequency (%) |
| 7510 | 1 | < 0.1% |
| 7500 | 3 | < 0.1% |
| 7490 | 1 | < 0.1% |
| 7480 | 5 | |
| 7470 | 3 | < 0.1% |
| 7460 | 8 | |
| 7450 | 8 | |
| 7440 | 5 | |
| 7430 | 8 | |
| 7420 | 11 |
| Distinct | 7200 |
|---|---|
| Distinct (%) | 89.8% |
| Missing | 2334 |
| Missing (%) | 22.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1369106.04 |
| Minimum | 81092 |
|---|---|
| Maximum | 17815350 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 81092 |
|---|---|
| 5-th percentile | 520045.2 |
| Q1 | 848340.5 |
| median | 1168272 |
| Q3 | 1664390.5 |
| 95-th percentile | 2788525.5 |
| Maximum | 17815350 |
| Range | 17734258 |
| Interquartile range (IQR) | 816050 |
Descriptive statistics
| Standard deviation | 868755.7309 |
|---|---|
| Coefficient of variation (CV) | 0.6345423259 |
| Kurtosis | 42.22362967 |
| Mean | 1369106.04 |
| Median Absolute Deviation (MAD) | 384484 |
| Skewness | 4.158243202 |
| Sum | 1.097886133 × 1010 |
| Variance | 7.5473652 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 955605 | 6 | 0.1% |
| 853803 | 6 | 0.1% |
| 1137606 | 5 | < 0.1% |
| 1158696 | 5 | < 0.1% |
| 1408185 | 4 | < 0.1% |
| 1243531 | 4 | < 0.1% |
| 1403492 | 4 | < 0.1% |
| 1137492 | 4 | < 0.1% |
| 1399217 | 4 | < 0.1% |
| 1120050 | 4 | < 0.1% |
| Other values (7190) | 7973 | |
| (Missing) | 2334 | 22.5% |
| Value | Count | Frequency (%) |
| 81092 | 1 | |
| 91485 | 1 | |
| 116603 | 1 | |
| 130150 | 1 | |
| 152114 | 1 | |
| 163305 | 1 | |
| 163894 | 1 | |
| 175845 | 1 | |
| 181982 | 1 | |
| 182438 | 1 |
| Value | Count | Frequency (%) |
| 17815350 | 1 | |
| 16244088 | 1 | |
| 12574770 | 1 | |
| 9589300 | 1 | |
| 9434450 | 1 | |
| 9346100 | 1 | |
| 9338880 | 1 | |
| 9336600 | 1 | |
| 8548290 | 1 | |
| 8526364 | 1 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 780 |
| Missing (%) | 7.5% |
| Memory size | 629.0 KiB |
| 10+ years | |
|---|---|
| 2 years | |
| 3 years | |
| < 1 year | |
| 5 years | |
| Other values (6) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.659876737 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10+ years |
|---|---|
| 2nd row | 10+ years |
| 3rd row | 2 years |
| 4th row | 10+ years |
| 5th row | 10+ years |
Common Values
| Value | Count | Frequency (%) |
| 10+ years | 3085 | |
| 2 years | 916 | 8.8% |
| 3 years | 866 | 8.4% |
| < 1 year | 795 | 7.7% |
| 5 years | 696 | 6.7% |
| 1 year | 648 | 6.3% |
| 4 years | 613 | 5.9% |
| 6 years | 566 | 5.5% |
| 7 years | 554 | 5.4% |
| 8 years | 472 | 4.6% |
| (Missing) | 780 | 7.5% |
Length
| Value | Count | Frequency (%) |
| years | 8130 | |
| 10 | 3085 | 15.5% |
| 1 | 1443 | 7.2% |
| year | 1443 | 7.2% |
| 2 | 916 | 4.6% |
| 3 | 866 | 4.3% |
| 795 | 4.0% | |
| 5 | 696 | 3.5% |
| 4 | 613 | 3.1% |
| 6 | 566 | 2.8% |
| Other values (3) | 1388 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Memory size | 653.3 KiB |
| Home Mortgage | |
|---|---|
| Rent | |
| Own Home | |
| HaveMortgage | 16 |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.7587 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Home Mortgage |
|---|---|
| 2nd row | Home Mortgage |
| 3rd row | Rent |
| 4th row | Rent |
| 5th row | Home Mortgage |
Common Values
| Value | Count | Frequency (%) |
| Home Mortgage | 4867 | |
| Rent | 4203 | |
| Own Home | 914 | 8.8% |
| HaveMortgage | 16 | 0.2% |
| (Missing) | 353 | 3.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| home | 5781 | |
| mortgage | 4867 | |
| rent | 4203 | |
| own | 914 | 5.8% |
| havemortgage | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Memory size | 727.8 KiB |
| Debt Consolidation | |
|---|---|
| Home Improvements | 593 |
| other | 561 |
| Other | 308 |
| Business Loan | 163 |
| Other values (11) | 497 |
Length
| Max length | 20 |
|---|---|
| Median length | 18 |
| Mean length | 16.387 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Debt Consolidation |
|---|---|
| 2nd row | Debt Consolidation |
| 3rd row | Debt Consolidation |
| 4th row | Debt Consolidation |
| 5th row | Debt Consolidation |
Common Values
| Value | Count | Frequency (%) |
| Debt Consolidation | 7878 | |
| Home Improvements | 593 | 5.7% |
| other | 561 | 5.4% |
| Other | 308 | 3.0% |
| Business Loan | 163 | 1.6% |
| Buy a Car | 142 | 1.4% |
| Medical Bills | 113 | 1.1% |
| Buy House | 70 | 0.7% |
| major_purchase | 52 | 0.5% |
| Take a Trip | 44 | 0.4% |
| Other values (6) | 76 | 0.7% |
| (Missing) | 353 | 3.4% |
Length
| Value | Count | Frequency (%) |
| debt | 7878 | |
| consolidation | 7878 | |
| other | 869 | 4.5% |
| home | 593 | 3.1% |
| improvements | 593 | 3.1% |
| buy | 212 | 1.1% |
| a | 186 | 1.0% |
| business | 163 | 0.8% |
| loan | 163 | 0.8% |
| car | 142 | 0.7% |
| Other values (13) | 526 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 9729 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18429.6717 |
| Minimum | 0 |
|---|---|
| Maximum | 229057.92 |
| Zeros | 8 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3585.585 |
| Q1 | 10202.8575 |
| median | 16052.055 |
| Q3 | 23881.3375 |
| 95-th percentile | 40596.6825 |
| Maximum | 229057.92 |
| Range | 229057.92 |
| Interquartile range (IQR) | 13678.48 |
Descriptive statistics
| Standard deviation | 12399.95619 |
|---|---|
| Coefficient of variation (CV) | 0.6728256691 |
| Kurtosis | 14.8449022 |
| Mean | 18429.6717 |
| Median Absolute Deviation (MAD) | 6614.185 |
| Skewness | 2.245736435 |
| Sum | 184296717 |
| Variance | 153758913.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8 | 0.1% |
| 13182.39 | 3 | < 0.1% |
| 15977.67 | 3 | < 0.1% |
| 14432.4 | 3 | < 0.1% |
| 12626.45 | 3 | < 0.1% |
| 13395.38 | 3 | < 0.1% |
| 12907.08 | 3 | < 0.1% |
| 18710.25 | 3 | < 0.1% |
| 15324.45 | 3 | < 0.1% |
| 14934.95 | 2 | < 0.1% |
| Other values (9719) | 9966 | |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 113.62 | 1 | < 0.1% |
| 190.19 | 1 | < 0.1% |
| 191.52 | 1 | < 0.1% |
| 278.92 | 1 | < 0.1% |
| 281.96 | 1 | < 0.1% |
| 286.52 | 1 | < 0.1% |
| 288.42 | 1 | < 0.1% |
| 288.61 | 1 | < 0.1% |
| 292.22 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 229057.92 | 1 | |
| 143526.57 | 1 | |
| 139664.82 | 1 | |
| 114938.98 | 1 | |
| 111289.84 | 1 | |
| 109791.5 | 1 | |
| 103778.95 | 1 | |
| 101813.97 | 1 | |
| 97996.49 | 1 | |
| 97150.42 | 1 |
| Distinct | 424 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.23593 |
| Minimum | 3.8 |
|---|---|
| Maximum | 62.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 8.9 |
| Q1 | 13.6 |
| median | 17 |
| Q3 | 21.7 |
| 95-th percentile | 31.7 |
| Maximum | 62.5 |
| Range | 58.7 |
| Interquartile range (IQR) | 8.1 |
Descriptive statistics
| Standard deviation | 7.018355774 |
|---|---|
| Coefficient of variation (CV) | 0.3848641541 |
| Kurtosis | 1.748104942 |
| Mean | 18.23593 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.071869791 |
| Sum | 182359.3 |
| Variance | 49.25731777 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 141 | 1.4% |
| 14 | 130 | 1.3% |
| 17 | 127 | 1.2% |
| 15.4 | 125 | 1.2% |
| 16.5 | 121 | 1.2% |
| 13 | 116 | 1.1% |
| 15 | 113 | 1.1% |
| 14.5 | 104 | 1.0% |
| 18.5 | 96 | 0.9% |
| 12 | 92 | 0.9% |
| Other values (414) | 8835 | |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 3.8 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 4.1 | 1 | < 0.1% |
| 4.3 | 2 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 4.7 | 3 | |
| 4.8 | 4 | |
| 4.9 | 2 | < 0.1% |
| 5 | 7 | |
| 5.1 | 4 |
| Value | Count | Frequency (%) |
| 62.5 | 1 | |
| 57.5 | 1 | |
| 52.5 | 1 | |
| 51.8 | 1 | |
| 50.9 | 1 | |
| 50 | 1 | |
| 49.9 | 1 | |
| 49.4 | 1 | |
| 49.2 | 1 | |
| 49 | 2 |
| Distinct | 89 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 5659 |
| Missing (%) | 54.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.96463571 |
| Minimum | 0 |
|---|---|
| Maximum | 131 |
| Zeros | 17 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 17 |
| median | 32 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 131 |
| Range | 131 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 21.64029066 |
|---|---|
| Coefficient of variation (CV) | 0.6189193801 |
| Kurtosis | -0.7471571255 |
| Mean | 34.96463571 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.4462521184 |
| Sum | 164124 |
| Variance | 468.3021797 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 109 | 1.1% |
| 17 | 103 | 1.0% |
| 9 | 94 | 0.9% |
| 12 | 92 | 0.9% |
| 23 | 90 | 0.9% |
| 8 | 87 | 0.8% |
| 19 | 86 | 0.8% |
| 38 | 86 | 0.8% |
| 6 | 85 | 0.8% |
| 11 | 85 | 0.8% |
| Other values (79) | 3777 | |
| (Missing) | 5659 |
| Value | Count | Frequency (%) |
| 0 | 17 | 0.2% |
| 1 | 19 | 0.2% |
| 2 | 41 | |
| 3 | 46 | |
| 4 | 44 | |
| 5 | 63 | |
| 6 | 85 | |
| 7 | 70 | |
| 8 | 87 | |
| 9 | 94 |
| Value | Count | Frequency (%) |
| 131 | 1 | < 0.1% |
| 107 | 1 | < 0.1% |
| 88 | 1 | < 0.1% |
| 87 | 2 | < 0.1% |
| 86 | 1 | < 0.1% |
| 83 | 2 | < 0.1% |
| 82 | 16 | |
| 81 | 29 | |
| 80 | 34 | |
| 79 | 37 |
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.0841 |
| Minimum | 1 |
|---|---|
| Maximum | 55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 7 |
| median | 10 |
| Q3 | 14 |
| 95-th percentile | 20 |
| Maximum | 55 |
| Range | 54 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 5.023380398 |
|---|---|
| Coefficient of variation (CV) | 0.4532059796 |
| Kurtosis | 3.183041041 |
| Mean | 11.0841 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.237947558 |
| Sum | 110841 |
| Variance | 25.23435063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 968 | 9.3% |
| 8 | 913 | 8.8% |
| 9 | 898 | 8.7% |
| 7 | 883 | 8.5% |
| 11 | 846 | 8.2% |
| 6 | 711 | 6.9% |
| 12 | 681 | 6.6% |
| 13 | 546 | 5.3% |
| 14 | 518 | 5.0% |
| 5 | 455 | 4.4% |
| Other values (35) | 2581 |
| Value | Count | Frequency (%) |
| 1 | 2 | < 0.1% |
| 2 | 35 | 0.3% |
| 3 | 125 | 1.2% |
| 4 | 291 | 2.8% |
| 5 | 455 | |
| 6 | 711 | |
| 7 | 883 | |
| 8 | 913 | |
| 9 | 898 | |
| 10 | 968 |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 47 | 2 | |
| 43 | 2 | |
| 42 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 39 | 2 | |
| 38 | 1 | < 0.1% |
| 37 | 2 | |
| 36 | 4 |
Number of Credit Problems
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1655 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 8653 |
| Zeros (%) | 83.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5009339712 |
|---|---|
| Coefficient of variation (CV) | 3.026791367 |
| Kurtosis | 62.05829572 |
| Mean | 0.1655 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.704018402 |
| Sum | 1655 |
| Variance | 0.2509348435 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8653 | |
| 1 | 1158 | 11.2% |
| 2 | 127 | 1.2% |
| 3 | 38 | 0.4% |
| 4 | 10 | 0.1% |
| 5 | 8 | 0.1% |
| 9 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 8653 | |
| 1 | 1158 | 11.2% |
| 2 | 127 | 1.2% |
| 3 | 38 | 0.4% |
| 4 | 10 | 0.1% |
| 5 | 8 | 0.1% |
| 6 | 2 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 3 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 8 | 0.1% |
| 4 | 10 | 0.1% |
| 3 | 38 | 0.4% |
| 2 | 127 | 1.2% |
| 1 | 1158 | 11.2% |
| 0 | 8653 |
| Distinct | 8430 |
|---|---|
| Distinct (%) | 84.3% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 290730.0637 |
| Minimum | 0 |
|---|---|
| Maximum | 16237438 |
| Zeros | 55 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 30133.05 |
| Q1 | 108651.5 |
| median | 207518 |
| Q3 | 362463 |
| 95-th percentile | 755940.65 |
| Maximum | 16237438 |
| Range | 16237438 |
| Interquartile range (IQR) | 253811.5 |
Descriptive statistics
| Standard deviation | 388168.6782 |
|---|---|
| Coefficient of variation (CV) | 1.335151492 |
| Kurtosis | 458.0452274 |
| Mean | 290730.0637 |
| Median Absolute Deviation (MAD) | 115681.5 |
| Skewness | 14.87107845 |
| Sum | 2907300637 |
| Variance | 1.506749227 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 55 | 0.5% |
| 106818 | 5 | < 0.1% |
| 171836 | 5 | < 0.1% |
| 208297 | 4 | < 0.1% |
| 144210 | 4 | < 0.1% |
| 76304 | 4 | < 0.1% |
| 80845 | 4 | < 0.1% |
| 229406 | 4 | < 0.1% |
| 151563 | 4 | < 0.1% |
| 252111 | 4 | < 0.1% |
| Other values (8420) | 9907 | |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 55 | |
| 38 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 114 | 3 | < 0.1% |
| 209 | 1 | < 0.1% |
| 247 | 1 | < 0.1% |
| 342 | 1 | < 0.1% |
| 361 | 1 | < 0.1% |
| 380 | 1 | < 0.1% |
| 418 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 16237438 | 1 | |
| 11796435 | 1 | |
| 11576491 | 1 | |
| 7111073 | 1 | |
| 5384106 | 1 | |
| 4882791 | 1 | |
| 4778671 | 1 | |
| 4252143 | 1 | |
| 4091042 | 1 | |
| 3964350 | 1 |
Maximum Open Credit
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWED| Distinct | 9064 |
|---|---|
| Distinct (%) | 90.6% |
| Missing | 353 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 687130.7806 |
| Minimum | 0 |
|---|---|
| Maximum | 145907344 |
| Zeros | 62 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 111362.9 |
| Q1 | 270600 |
| median | 462605 |
| Q3 | 786115 |
| 95-th percentile | 1641385.9 |
| Maximum | 145907344 |
| Range | 145907344 |
| Interquartile range (IQR) | 515515 |
Descriptive statistics
| Standard deviation | 1861394.4 |
|---|---|
| Coefficient of variation (CV) | 2.708937589 |
| Kurtosis | 3768.372941 |
| Mean | 687130.7806 |
| Median Absolute Deviation (MAD) | 229449 |
| Skewness | 51.4146651 |
| Sum | 6871307806 |
| Variance | 3.464789114 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 62 | 0.6% |
| 594836 | 4 | < 0.1% |
| 452232 | 4 | < 0.1% |
| 225258 | 4 | < 0.1% |
| 312114 | 4 | < 0.1% |
| 217602 | 3 | < 0.1% |
| 622600 | 3 | < 0.1% |
| 422950 | 3 | < 0.1% |
| 316140 | 3 | < 0.1% |
| 479952 | 3 | < 0.1% |
| Other values (9054) | 9907 | |
| (Missing) | 353 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 5390 | 1 | < 0.1% |
| 6644 | 1 | < 0.1% |
| 8712 | 1 | < 0.1% |
| 8866 | 1 | < 0.1% |
| 10846 | 1 | < 0.1% |
| 11066 | 2 | < 0.1% |
| 11264 | 1 | < 0.1% |
| 12958 | 1 | < 0.1% |
| 13134 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 145907344 | 1 | |
| 45042030 | 1 | |
| 37527424 | 1 | |
| 25148882 | 1 | |
| 24582712 | 1 | |
| 22454278 | 1 | |
| 20172680 | 1 | |
| 20108220 | 1 | |
| 19194758 | 1 | |
| 18287852 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 375 |
| Missing (%) | 3.6% |
| Memory size | 599.4 KiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | 46 |
| 3.0 | 14 |
| 5.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 8895 | |
| 1.0 | 1022 | 9.9% |
| 2.0 | 46 | 0.4% |
| 3.0 | 14 | 0.1% |
| 5.0 | 1 | < 0.1% |
| (Missing) | 375 | 3.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 8895 | |
| 1.0 | 1022 | 10.2% |
| 2.0 | 46 | 0.5% |
| 3.0 | 14 | 0.1% |
| 5.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 354 |
| Missing (%) | 3.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03080308031 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 9810 |
| Zeros (%) | 94.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 81.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2882149869 |
|---|---|
| Coefficient of variation (CV) | 9.356693683 |
| Kurtosis | 371.2997812 |
| Mean | 0.03080308031 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.28358748 |
| Sum | 308 |
| Variance | 0.0830678787 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9810 | |
| 1 | 131 | 1.3% |
| 2 | 31 | 0.3% |
| 3 | 13 | 0.1% |
| 4 | 8 | 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| (Missing) | 354 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 9810 | |
| 1 | 131 | 1.3% |
| 2 | 31 | 0.3% |
| 3 | 13 | 0.1% |
| 4 | 8 | 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 8 | 0.1% |
| 3 | 13 | 0.1% |
| 2 | 31 | 0.3% |
| 1 | 131 | 1.3% |
| 0 | 9810 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Loan ID | Customer ID | Current Loan Amount | Term | Credit Score | Annual Income | Years in current job | Home Ownership | Purpose | Monthly Debt | Years of Credit History | Months since last delinquent | Number of Open Accounts | Number of Credit Problems | Current Credit Balance | Maximum Open Credit | Bankruptcies | Tax Liens | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | f738779f-c726-40dc-92cf-689d73af533d | ded0b3c3-6bf4-4091-8726-47039f2c1b90 | 611314.0 | Short Term | 747.0 | 2074116.0 | 10+ years | Home Mortgage | Debt Consolidation | 42000.83 | 21.8 | NaN | 9.0 | 0.0 | 621908.0 | 1058970.0 | 0.0 | 0.0 |
| 1 | 6dcc0947-164d-476c-a1de-3ae7283dde0a | 1630e6e3-34e3-461a-8fda-09297d3140c8 | 266662.0 | Short Term | 734.0 | 1919190.0 | 10+ years | Home Mortgage | Debt Consolidation | 36624.40 | 19.4 | NaN | 11.0 | 0.0 | 679573.0 | 904442.0 | 0.0 | 0.0 |
| 2 | f7744d01-894b-49c3-8777-fc6431a2cff1 | 2c60938b-ad2b-4702-804d-eeca43949c52 | 153494.0 | Short Term | 709.0 | 871112.0 | 2 years | Rent | Debt Consolidation | 8391.73 | 12.5 | 10.0 | 10.0 | 0.0 | 38532.0 | 388036.0 | 0.0 | 0.0 |
| 3 | 83721ffb-b99a-4a0f-aea5-ef472a138b41 | 12116614-2f3c-4d16-ad34-d92883718806 | 176242.0 | Short Term | 727.0 | 780083.0 | 10+ years | Rent | Debt Consolidation | 16771.87 | 16.5 | 27.0 | 16.0 | 1.0 | 156940.0 | 531322.0 | 1.0 | 0.0 |
| 4 | 08f3789f-5714-4b10-929d-e1527ab5e5a3 | 39888105-fd5f-4023-860a-30a3e6f5ccb7 | 321992.0 | Short Term | 744.0 | 1761148.0 | 10+ years | Home Mortgage | Debt Consolidation | 39478.77 | 26.0 | 44.0 | 14.0 | 0.0 | 359765.0 | 468072.0 | 0.0 | 0.0 |
| 5 | a4957169-d809-44cc-847b-975400bc8d11 | 6878d414-6a22-4712-ae43-9b3f798e463a | 202928.0 | Short Term | 741.0 | 760380.0 | 1 year | Rent | Debt Consolidation | 6526.69 | 13.8 | NaN | 6.0 | 0.0 | 258647.0 | 476872.0 | 0.0 | 0.0 |
| 6 | 43467302-94fe-494b-b52f-3fd891fea71c | 48113a98-a4a0-4956-b57d-f0ce344826fb | 621786.0 | Long Term | 733.0 | 1783606.0 | 10+ years | Home Mortgage | Debt Consolidation | 36563.98 | 15.3 | NaN | 42.0 | 0.0 | 281599.0 | 1449162.0 | 0.0 | 0.0 |
| 7 | 930c7cb3-6086-434a-9547-3ed41c181552 | 19941661-98e2-4800-93c9-a0e92057c813 | 266794.0 | Long Term | NaN | NaN | < 1 year | Own Home | Debt Consolidation | 12336.89 | 5.8 | NaN | 9.0 | 0.0 | 233206.0 | 342232.0 | 0.0 | 0.0 |
| 8 | 0b2f1b66-741e-4e37-a929-99926cdc9e9a | 6a1adeda-079b-49e5-ac7c-91828f2806a0 | 202466.0 | Short Term | 736.0 | 1068617.0 | 5 years | Rent | Debt Consolidation | 18745.21 | 20.5 | NaN | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 9 | d08f3a5e-93df-40e7-bdd8-cba59180bddf | 4080a828-a61a-4f04-a627-397f4319500c | 266288.0 | Long Term | 683.0 | 2031518.0 | 2 years | Rent | Debt Consolidation | 12443.10 | 24.4 | 56.0 | 8.0 | 2.0 | 31445.0 | 251130.0 | 2.0 | 0.0 |
Last rows
| Loan ID | Customer ID | Current Loan Amount | Term | Credit Score | Annual Income | Years in current job | Home Ownership | Purpose | Monthly Debt | Years of Credit History | Months since last delinquent | Number of Open Accounts | Number of Credit Problems | Current Credit Balance | Maximum Open Credit | Bankruptcies | Tax Liens | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10343 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10344 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10345 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10346 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10347 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10348 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10349 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10350 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10351 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 10352 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |